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1. Introduction. This paper discusses certain useful relationships between non- 
linear programming algorithms and the formulation of optimization problems for sys- 
tems governed by state constraints. The simplest instance of such a problem is 

(1) minimize f(x) = f(x,u(x)), 

where / is a real-valued function and u(x) is computed via the solution of a system 
of equations 


h(x, u(x)) = 0. 

Think of x as the design variables of the problem, and of u as the state variables de- 
scribing the physical state of the system to be optimized. The equations h(x, u(x)) = 0 
represent typically a system of ODE or PDE that govern the system. Given a value 
of the design variables x, we can solve these equations for the state u(x). In the com- 
putational setting, the design variables x and state variables u will each lie in some 
finite-dimensional vector space. 

More generally, of course, the problem could have additional constraints. How- 
ever, to present the points I wish to make I will consider only the original, simpler 
problem (1). 

I will assume that the state equations are a block system; two blocks will suffice 
for my purposes: 


h(x, Ml, u 2 ) 


h 1 (x, Ml, m 2 ) 
h 2 (x, Ml, m 2 ) 


In the context of Multidisciplinary Design Optimization, the blocks hi might describe 
the state equations for different disciplines, such as structures and aerodynamics, 
together with interdisciplinary consistency conditions, and the aggregate h describes 
the coupled multidisciplinary analysis system. See [5] for a further discussion of this 
point. 

I will further assume that given x and mi, we can solve 


h\(x, Ml, u 2 (x, Ml)) = 0 


for u 2 (x, Mi), and, vice versa, given x and m 2 , we can solve 


h 2 (x, ui(x, u 2 ), u 2 ) = 0 


for ui(x,u 2 ). This hypothesis simply reflects the independent solubility of the dis- 
ciplinary analyses. Note that this hypothesis means that the partition of the state 
variables corresponds to that of the state equations; i.e. , if there are rii equations in 
hi, then there would be rii state variables in m;, and the system solved for m; would 
be square. 

I will consider three possible formulations of the optimization problem (1). The 
first is the problem in its ostensibly unconstrained form; following [5], I will refer to 
this as the Multidisciplinary Feasible (MDF) formulation: 

minimize /mdf(x) = f(x,ui(x),u 2 (x)) 

(2) where h\(x, ui(x), u 2 (x)) = 0 

h 2 (x, ui(x), u 2 (x)) = 0. 
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At each iteration, we feed x into the multidisciplinary analysis system and solve for 
the state u. Only the design variables x are treated as independent variables. This is 
an example of a variable reduction approach, in which we use the equality constraints 
to define some of the variables as functions of the others, and eliminate them as 
independent variables. 

At the other extreme, we could view the state equations purely as equality con- 
straints, yielding the AU-at-Once (AAO) formulation: 

minimize fAAo(x, «i, u 2 ) = f(x, ui, u 2 ) 

(3) subject to h\(x, u\, u 2 ) = 0 

h 2 (x, «i, u 2 ) = 0. 

This approach has many different names in many different Helds. For instance, it 
is known as Simultaneous Analysis and Design (SAD or SAND) in structural opti- 
mization [11]. It is with this view of the state equations as equality constraints that 
“feasible” appears in the name “Multidisciplinary Feasible” ; at each iteration we are 
feasible with respect to the state constraints. The rationale for such a formulation is 
that rather than spend a lot of time attaining multidisciplinary feasibility far from 
an optimal solution, this constrained formulation allows us the freedom to attain 
feasibility only at the same time as optimality. 

Intermediate to these two formulations is one that, using John Dennis’s apt ter- 
minology, I will call the In-Between formulation. In this approach we eliminate one 
of the blocks of state variables via the state equations (I choose to eliminate u 2 ) to 
arrive at the following problem: 

minimize //s(*, u i) = f(x, u i, u 2 (x, u i)) 

(4) subject to hiB(x,ui) = hi(x,ui,u 2 (x,ui)) = 0 

where h 2 (x, u\, u 2 (x, Mi)) = 0. 

In the taxonomy presented in [5], the In-Between formulation corresponds to the Indi- 
vidual Discipline Feasible formulation. The consistency constraints of [5] correspond 
to h i in (4), while the individually feasible disciplines are subsumed in h 2 . This for- 
mulation shares with the AAO formulation the advantage of additional degrees of 
freedom afforded by the constraints, while having perhaps substantially fewer explicit 
variables in the optimization problem, since we have performed a partial variable re- 
duction by eliminating u 2 as an independent variable. For a further discussion of the 
relative advantages and disadvantages of these formulations, see [5]. 

The goal of this paper is to show how these three formulations, the associated 
sensitivities, and reduced basis methods for their solution are related. I will show how 
one may, in a fairly simple manner, pass between the different formulations together 
with algorithms for their solution. It will turn out that to apply a reduced basis 
algorithm to the three formulations requires, in principle, only a change to one step of 
the optimization algorithm. This is due to the structure of the problem introduced by 
the state constraints. This structure has important consequences for the design and 
implementation of disciplinary analysis codes if they are to be used in optimization. 

The work in [7] compares these three formulations for a model large-scale param- 
eter estimation problem and shows the potential promise of the In-Between formula- 
tion. The implementation used in that work realizes the idea presented here, and the 
three formulations were solved with substantially the same code, as will be discussed 
in §5. 
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In §2 I will review the notions of the reduced gradient and the reduced Hessian. 
In §3, I will examine the sensitivity calculations associated with the MDF and In- 
Between formulations. There I will show how they can be obtained from those of the 
AAO formulation. In §4 I will outline how a reduced basis nonlinear programming 
(NLP) algorithm works, and in §5 show how one may pass between reduced basis 
algorithms for the solution of the different formulations. 

2. The reduced gradient and Hessian. In this section I will review the no- 
tions of the reduced gradient and the reduced Hessian. First recall the classical implicit 
function theorem [4]. 

Theorem 2.1 (The Implicit Function Theorem). Suppose $ is a mapping 
from an open subset of R m x R n into R n . Suppose (yo,vo) is a point m R m x R n 
such that 

1. <F(t/ 0 , v 0 ) = 0/ 

2. $ is continuously differentiable at (y o,vo); 

3. the matrix ^-(y oMo) is invertible. 

Then there exists a neighborhood T of yo such that for each y £ T, the eguation 
<F(t/, v) = 0 is soluble for v(y) £ R n . Moreover, the derivative of this solution v(y) 
with respect to y is given by 

dv /9<F\ -1 9$ 

( ’ dy = ~ ) <V 

Formally, the latter formula is the result of implicit differentiation of <1 >(?/, v) = 0 
to obtain 

9$ 9$ dv 

dy dv dy 

and then (5). 

Now, suppose that v(y) is the unique solution to a state relation 
(6) $(y, v(y)) = 0 

where $ is twice continuously differentiable and 9 < F/9 l is invertible. In practice, 
the validity of these hypotheses typically follows from the existence and uniqueness 
theory for the solution of the equation represented by (6). With these hypotheses, I 
may apply the Implicit Function Theorem to compute sensitivities. Before doing so, 
let me first define the matrix 


W = W(y,v) = 


I 

9$-! 9$ 


I will call W the injection operator associated with $ since it is a one-to-one mapping 
from R m into R m x R n and is invertible on its range (i.e. , W has full column rank). 
Its transpose W T I will call the reduction operator. 

Let 


9$ 




dy 


and 


dv 


We call B the basic block since, by assumption, B = 9 < F/9l is invertible and hence is 
a basis of R n . The associated state variables v are the basic variables. The remaining 
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design variables, y, are the non-basic variables, and N the non-basic block. In terms 
of the basic and non-basic blocks, 


W ( -B^N ) ' 

Note that W is a basis for the nullspace of the linearized state constraint operator: 

( 7 ) V<S> t W=(N 5 )(_/ i # ) = 0 . 

I will return to this point in §4 and §5. 

Finally, given F : R m x R n — >■ R, define A by 

/< 9 $\ ~ T 

(8) \ = \(y,v) = - y—j (y,v)V v F(y,v) 

and the Lagrangian l(y, v, A) by 

£(y, v; A) = F(y, v) + A T $(t/, v). 

The quantity A is a Lagrange multiplier estimate arising from variable reduction [10]. 
For those familiar with the language of “the adjoint method” in control, A is the 
costate or adjoint state. 

Proposition 2.2. Let F(y) = F(y,v(yj), where v(y) is computed via (6), and 
F : R m x R n ->■ R^. Then 

(9) V y F T (y) = V ( y , v) F T (y, v(yj)W(y, v(yj). 

If F is a real-valued function, then 

(mJjF(y) = W T (y, v(y)) (v? yiV) F(y, v(y)) + v(y))\(y, v(y)j) W(y, v(y)). 


By ^^A I mean 


V (y,«) $A = E A '- V (y,«) $ '-- 
2 = 1 


In the case of a real- valued F , transposition of (9) tells us that V-F is given by 
(11) ^yF(y) = W T (y, v(y))V ( y tV) F(y, v(y)). 

The quantity appearing on the right-hand side of (11) is known as the reduced gradient 

[10], since it is the product of the reduction operator W T with VF. Similarly, the 
quantity on the right-hand side of (10) is called the reduced Hessian of the Lagrangian. 
Proof Computing the Jacobian of F, we see that 


dF 

dy 


(v) 


dF dF dii 
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This and the Implicit Function Theorem then yield 

f"* = $£<*•<*»-£<*•<*»(£) (>■>-(>»§(>■ <■(!>» 

= (§^» f <»■”(»))) <#.»(»)) 

which is (9). 

Next, the Hessian for real- valued F. We have 


d 2 F d L , /xx „ , . xx dv 


d 2 F dF y dv T dF y dF v dv dv T dF v dv d 2 v 

dy 2 dy ^ dy dv ^ dy dy ^ dy dv dy v dy 2 

We can rewrite this in terms of the reduction and injection matrices as 


/ T x / d_Fy_ m 

d F — | i ] dy dy 

dx 2 \ dy J dFy dF v 

\ dv dv 


= wT [K,) F ) w + F ^- 


Repeated implicit differentiation of <1 >(?/, v(y)) = 0 yields 


— $ y (j/, v(y)) + <S> v (y, v(y))— = 0 


whence 


3$\ 1 / d$ y dv T d$ y d$ v dv dv T d$ v dv\ 

dv J \ dy dy dv dy dy dy dv dy I 


d 2 v dF ( dh\ I d$ y dv 1 d$ y d$ v dv dv 1 d$ v dv 

v dy 2 dv \du J l dy dy dv dy dy dy dv dy 

t ( d$ y dv T d$ y d$ v dv dv T d<b v dv\ 

\ dy dy dv dy dy dy dv dy J 


= WT {K,v)^) W - 


Thus we finally arrive at 


0(!/) = {vfy,v)F + V(y, V ) hA ) W = wT (vf y , „/(!/, v(y); X (y, v(y))) W, 


which is (10). □ 
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3. Sensitivities for the three formulations. In this section I will derive some 
formulae relating the sensitivities of the MDF, In-Between, and All-at-Once formu- 
lations. You will see that the sensitivities are all derived from those of the AAO 
formulation in simple and interesting ways. 

Proposition 2.2 says that the gradient and Hessian associated with the MDF for- 
mulation are the reduced gradient and reduced Hessian of the Lagrangian associated 
with the AAO formulation, as we will now discuss. Let Baao and N aao denote the 
basic and non-basic blocks associated with the AAO formulation, 


( 

5*1 

5*i 

\ 


5«i 

du 2 



to 

to 



dui 

du 2 

) 


Naao — 



let Waao denote the reduction matrix associated with the AAO formulation, 


WaA ° ( S^aoNaao ) ’ 

let A aao denote the Lagrange multiplier associated with the AAO formulation, 

A aao = —Bj^qVu/aao, 

and let Iaao denote the Lagrangian associated with the AAO formulation, 

£aao(x, u; A) = f A Ao(x, u) + A T h(x, u). 

Then we have this corollary of Proposition 2.2. 

Corollary 3.1. Suppose that (x,u) is feasible with respect to the multidisci- 
plinary analysis constraints: h(x,u) = 0. Then the gradients and Hessians at (x,u) 
of the MDF and AAO formulations are related by 

(12) V xfMDF(x) = Waao{ x , u W (x,u)fAAo{x,u) 

(13) — AAO (^ , ^) ^{x,u)£ AAO (^ , ^ , ^AAO (^ , ^)) 

and 

(14) Vlf M DF(x) = W T (x, u) {^l XtU) iAAo{x, u; A A ao(x, u))) W A ao(x, u). 


Proof The relations (12) and (14) follow immediately from Proposition 2.2. The 
equation (13) is a consequence of the fact that W AAO Nh = 0. □ 

Next we turn attention to the sensitivities associated with the In-Between formu- 
lation (4). We will denote by S the Schur complement 

_ dhi dhi ( dh 2 \~ 1 dh 2 
1 j ~ d Ul du 2 \du 2 ) dui 


This is the Schur complement of the basic block Baao upon elimination of the variable 
u 2 . You can easily check that 


r>— 1 

n AAO ~ 

/ 

(16) 

V " 


S '- 1 

5*2 V 1 5*2 c 1 

du 2 j dui 


-q- 1 — ( 1 ^ 

du 2 \du 2 j 

dhA' 1 f ^V 1 5*2 „_i 5*i / 5*2 V 1 
du 2 j \du 2 ) dui du 2 \du 2 ) ) 
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I will also denote by W 2 the reduction matrix associated with the elimination of u 2 : 


0 \ 

-fui 

( -1 

\du 2 J du 2 ) 

The next result concerns the basic and non-basic blocks for the In-Between for- 
mulation. Equations (15) and (19) mean that the basic block associated with the 
In-Between formulation (4) is the Schur complement that results if u 2 is eliminated 
from the basic block of the All-at-Once formulation. 

Proposition 3.2. The nonbasic block associated with the In-Between formulation 

(A) ls 

dh IB _ dhi dhi ( dh^A -1 dh 2 

dx dx du 2 \du 2 ) dx 


(17) 


W 2 = 


I x 

0 


dh 2 

du 2 


dh 2 

dx 


The basic block associated with the In-Between formulation (4) is 
dhjs 


(19) 


du-\ 


dhi 

dui 


dhi 

du 2 


dh 2 

du 2 


-1 


dh 2 

dui ' 


Proof Applying Proposition 2.2 with y = (x,u 1 ), v = u 2 , and F(x,ui) = 
hiB{x,ui,u 2 (x,ui)) yields 


f dhjs dhjs 
\ dx dui 


dhi dhi dhA 

dx dui du 2 j 2 


/ 

/ dhi dhi dhi \ 

\ dx dui du 2 J 

V 

from which we obtain (18) and (19). 
Let 


I x 

0 

f dh 2 \ _1 dh 2 

\du 2 J dx 


0 \ 

A 1 

dh 2 \ _1 dh 2 ’ 

du 2 J dui ) 


(20) 1 Vib — i f dhjs \ 1 dhis J 

\ \ dui J dx ) 


denote the reduction matrix associated with eliminating ui from the In-Between for- 
mulation. Let 


(21) 


^IB 


dhjs 

dui 


-T 

V Ul flB 


denote the Lagrange multiplier associated with the In-Between formulation. In the 
In-Between formulation, we eliminate u 2 from the AAO formulation as an explicit 
variable; the following proposition says that the the multiplier A jb can be obtained 
from the AAO multiplier by dropping the component corresponding to the constraints 
that define u 2 . 
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Proposition 3.3. At a point (x,u i) feasible with respect to his, i.e., satisfying 
hig(x, u i) = 0 ( and a fortiori, h 2 (x, u u 2 (x, u i) = §), we have 

(22) \i B {x,ui) = (I Ul 0) \ A ao{x,ui,u 2 ). 


Proof Applying Proposition 2.2 to compute ^( XtUl )fiB(x,u i) we have 

A IB = -S~ T V Ul fiB(x,ui) 

— S (0 ) ^(x,ui)flB (t, III) 

— S (0 ^tii) ^2 ^( X) u i ,U2 )fAAO (t , Ui j H 2 ) • 


Writing this out leads to 

ib = —S~ T ( 0 I Ul ) 
= - (0 S~' 


X 


l i n _^ 2 T ^V T \ 

* dx \du 2 j 

n T -Uhl (Uhl ~ T 

Ul du\ V3m 2 


1 ,U 2 ) IaAO 7 ^ 1 ; ^ 2 ) 


) 


_ 9/72 /" <9/72 ^ \ _ f / \ 

$ q u ^ \dlL 2 ) J ^(x,u 1 ,u 2 )fAAo{x,Ui,U2) 

_ (/«! ^AAO ^0 0 I ^ ^ (x ) u 1) u 2 )fAAo{% 1 u l-> ^ 2 ) 


^) ^li ^ 2 ) 

— (An 0) ^AAO- 


□ 

We have the following factorization of the All-at-Once reduction operator in terms 
of the In-Between reduction operator and the reduction operator associated with the 
elimination of u 2 as an explicit variable. 

Proposition 3.4. Suppose that (x,u 1) is feasible with respect to his, as in 
Proposition 3.3. Then 


(23) 


Waao — W 2 Wib- 


Proof. The proof is another icky calculation. We have 


and 



/ I x 
0 

( dh 2 \ -1 dh 2 

\ \du 2 ) dx 


0 \ 

lu 1 

dh 2 \ _1 dh 2 ’ 

du 2 ) du 2 ) 
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so 


W 2 W IB = 


v 


dh 2 

du 2 



But from (16) we see that the last two rows of this product are none other than 


~ b aao n aao - 

l 


S~ 


-s 


dh 2 

du 2 


-l 


Uhl a- 1 

5«i 


dh 2 

du 2 


-1 


_i dhi 
du 2 

dh^ ~ 1 


dh 2 

du 2 


-l 


dh 2 ! dhi 
du 2 J dui du 2 


dh 2 

du 2 


-l 


/ 

[ dx 

1 dhi 

dx 


V 


Thus Waao = D 

The next proposition says that the reduced gradient and reduced Hessian for 
the In-Between and All-at-Once formulations are the same at appropriately feasible 
points. 

Proposition 3.5. Suppose that (x,u i) is feasible with respect to his, as in 
Proposition 3.3. Then 

(24) EE/ ^ ^ (x ,u i) flB (x , I/i) — ^^AAO ^ (x ,u i ,u 2 ) /aAO (t , U\ , 1/2) 7 

(25) ^^IB ^ (x ,u 1 )^ IB (t , Ui j ^IB ) — ^^AAO ^ (x ,u 1 ,u 2 )^AAO (t , I/i , 1/2 7 ^lio) 7 

(26) V( :C)tli) f/ B (T, Mi; A/ S ) = 1 T 2 t «i, «25 A^to)) EEt 

and 


(271)1 / p B V( :C)tli) f/ B (T, «i; A/ B )1T/ B = W / J AO V( :C)tli)tl2) fATo(T, «i, «25 Aato)W / ato, 

where A/ S = A ib(x, «i) and \aao = A^^ 0 (*, «i, u 2 ). 

Proof. From Proposition 2.2 we have 

^ (x ,Ul)f IB V 1 u 1 ,u 2 )J AAO 1 

this combined with Proposition 3.4 yields (24). Equation (25) then follows since 
Wf B S7h IB = 0 and Wj AO Vh = 0. 

Applying Proposition 2.2 to F = £ib we obtain 

^tx, Ul /lB = W 2 (yf XtUltU2 )fAAO + ^\ x ,u 1 ,u 2 ) h ^IB + ^\x,u u u 2 ) h 2 A 2 ) W 2 


where A 2 is given by 


( dh \ 

^2 — ( ) (0 ^2) (x ,ui ,U2)f AAO + ,u 1 ,u 2 ) ^IB ) • 
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Using (16) and (21), we obtain 



— (0 I U2 ) ( B AA o* 7 u f aao) 
= (0 Iu 2 )^aao- 


At this point the component of A aao corresponding to the block h 2 reappears. 
Moreover, (22) tells us that A jb = (An OjA^o- Thus, 

= W 2 (V^, ui , ua )/AA 0 + ^\x,u 1 ,u 2 ) hX AAo) W 2 , 

which is (26). Moreover, by Proposition 3.4, 

^I T B^( x ,ui/lBW IB = Wj B W? {y 2 ( XiUliU2 )f AAO +^f XtUltU2 )h^AAO^ W 2 W 1 B 

= WJao^Uu^aaoWaao. 

□ 


4. Reduced basis algorithms. In this section I will discuss a model Sequential 
Quadratic Programming (SQP) subproblem and reduced basts methods for its solution. 
SQP subproblems similar to this one are at the heart of many equality constrained 
NLP algorithms. 

The fundamental SQP subproblem for (3) is 

, . minimize ^ s T Hs + g T s 

' ' subject to Vh T s + h = 0, 

where H is an approximation to the Hessian of the Lagrangian and g is the gradient of 
the Lagrangian. For a trust-region algorithm one would have, in addition, some man- 
ner of trust-region constraint, but for now I will ignore this detail. This subproblem 
may be stated in many equivalent ways; for a discussion, see [15]. 

Now, if one had a point s LF that was feasible with respect to the linearized 
constraints, and a basis Z for the nullspace of the Jacobian of the constraints, then 
one could express s as 

s = s LF + Zcr 


and the SQP subproblem could be reduced to an unconstrained problem: 
(29) minimize ^cr T Z T H Zcr + (g + H s LF ) T Z T a. 


In a reduced basts approach to an equality constrained optimization problem we 
take advantage of a certain form of basis for the nullspace of the linearized constraints 
[10]. At each iteration of the optimization algorithm, we seek a partition of the 
Jacobian of the constraints of the form 


where B is invertible; then 


V/i T = (N B) 


Z = 
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is a basis for the nullspace of V/i T . 

But we have already seen an example of such a nullspace basis at work. In the 
case of state constraints, we have an immediate and fixed choice for the basis matrix 
B, to wit, the linearized state operator. This is the import of (7). Moreover, the 
invertibility of B allows us to obtain a linearly feasible point s LF ; we may simply take 

<*» * I '=( -*-»)■ 

This corresponds to solving the linearized state equation, or, if one prefers, taking one 
unglobalized step of Newton’s method towards the solution of the state equation. 

In terms of W, then, the model SQP subproblem (28) could be transformed to 
the unconstrained problem 

minimize ^a T W T HWa + (g + H s LF ) T W T a. 

One would solve this problem for the reduced step a — a step in the design variables 
alone — and then take the step s = W cr in all the degrees of freedom in the optimization 
problem, both the design variables and those state variables kept explicit. 

Note that this is not only a convenient reduction, but in the context of MDO this 
is almost surely a necessary step. The SQP subproblem involves attaining feasibility 
with respect to the linearized state constraints. Therefore the SQP subproblem is nec- 
essarily more difficult than solving the linearized multidisciplinary analysis problem, 
which, for problems involving PDE, may be quite difficult in itself. In order to solve 
the optimization subproblem we must exploit the structure of the problem — in this 
case, its relation to the state constraints — and avail ourselves of the pieces available 
from the solution of the analysis problem. 

5. A simple way of implementing reduced basis algorithms for the dif- 
ferent formulations. In this section I will show how the relations between the dif- 
ferent formulations in §3 mean that, in principle, it is possible to implement an opti- 
mization algorithm for the AAO formulation that with a single modification becomes 
an algorithm for either the MDF or In-Between formulations. I have actually done 
such an implementation for the work described in [7]. 

The key to such an implementation lies in the relations that were presented in 
§3. To make matters concrete, I will discuss the following trust-region algorithm for 
large-scale equality constrained optimization problems. This algorithm was used for 
the work in [7] and is a relative of the algorithms described in [3] and [6]. I use 
the subscript and superscript “c” to denote quantities associated with the current 
iterate, and the subscript and superscript “+” to denote quantities associated with 
the putative next iterate, following [8]. 

At each iteration, we approximately solve the subproblem 

minimize ^s T Hs + g T s 

(31) subject to || V/i T s + h c || < 6 

IMI<* 

where H is an approximation to the Hessian of the Lagrangian, g is the gradient of 
the Lagrangian, 6 is the current trust radius, and 9 is chosen to enforce Fraction of 
Cauchy Decrease on || V/i T s + h c || over a trust-region of radius rS, where 0 < r < 1 
is fixed. 
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The approximate solution of the subproblem is effected in two stages. The first 
is to take a step in the basic variables to improve linear feasibility, rather as in (30): 


(32) 

where 


d = 


The quantity a is chosen to solve 


„LF 


= ad, 


0 


- ^ h 


-1 


minimize A aVh T d+h c 


subject to 


ad II < r6. 


The second stage is then to improve optimality subject to the constraint of not 
degrading the improved linear feasibility achieved by s LF . Let 


0 = Vh T .s LF + h c . 


The subproblem we solve to improve optimality subject to improved linear feasibility 
is 

minimize ^s T Hs + g T s 
(33) subject to V/i T s + /i c = 0 

This we do by using the reduced basis and writing the desired step as 

s = s LF + W a 


and solving the trust-region problem 

, . minimize ^a T W T H W cr + (g + Hs LF ) T W T a 

' ' subject to || s LF + Wa || <6. 

We then test the aggregate step s using the augmented Lagrangian 
L(y, A, p) = f(y ) + A T h(y) + | || h(y) || 2 , 

as the merit function with the penalty weight p updated as described in [9]. Figure 1 
contains a rough outline of the algorithm for the AAO formulation. 

Now suppose that I modify this algorithm by adding an analysis step in which 
given design variables x c , I solve the multidisciplinary analysis problem h(x c , u c ) = 0. 
This single modification, illustrated in Figure 2, yields a trust region algorithm for 
the MDF formulation. 

Theorem 5.1. The modified algorithm in Figure 2 is eguivalent to a trust region 
algorithm for the MDF formulation. 

Proof I will check the following: 

1. At every step, we are feasible with respect to the constraint h(x c ,u c ) = 0. 
This is, of course, true by design. 
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Initialization: Choose an initial (x c ,u c ). 
Until convergence, do { 

1. Compute the multiplier 


^aao — 


dh 

du 


-l 

^u/aao- 


2. Test for convergence. 

3. Construct a local model of Iaao about ( x c , u c ). 

4. Take a step s LF to improve linear feasibility: 



5. Subject to the improved linear feasibility, take a step to improve 
optimality: 

minimize \ o- T Wj AO H AAO W AAO (T + ( g A AO + H AAO s LF ) T Wj AO a 
subject to || s LF + W AA o<x || < 8. 

6. Set s = (s x ,s u ) = s LF + W AAO cr. 

7. Evaluate (*+,«+) = (x c , u c ) + (s x ,s u ). 

8. Update (x c ,u c ), 8, &c. 


Fig. 1. Reduced basis algorithm for the ^4^40 formulation. 


2. The subproblems we solve in the context of the AAO formulation correspond 
to a trust region subproblem for the MDF formulation. 

3. The merit function we use to evaluate the step taken for the AAO formulation 
corresponds to the correct merit function for the MDF formulation. 

Because we solve the analysis problem h(x c , u c ) = 0 at each iteration, h c = 0 and 
so s LF = 0. The subproblem we solve to improve optimality then becomes 

minimize ^o- T Wj AO H AAO W AAO o- + 9 AA o W AAO a 
subject to || W AA ocr || < 8. 

However, in light of Corollary 3.1, this is none other than 

minimize \(T T VlfMDF^ + V x fl IDF a 
subject to || W AA ocr || < 8. 

This we recognize as a trust-region problem for the MDF formulation, which estab- 
lishes Point 2. Note, however, the non-standard scaling of the trust region constraint. 
As discussed in [14], this scaling is quite natural since it measures the effect of changes 
in the design variables on the state variables. 

Moreover, Point 3 holds. Because h(x,u) = 0 at each step, the augmented La- 
grangian merit function reduces to 

f A Ao(x, u(xj) + \ AAO h(x, u(xj) + ^ || h(x, u(xj) || 2 = f AAO (x, u(xj) = f M DF {x ) , 
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Initialization: Choose an initial x c . 

Until convergence, do { 

0. Analysis: Solve h(x,u(x)) = 0 for the state u(x). 

1. Test for convergence. 


7. Analysis: Solve h(x + ,u + ) = 0 for u + (x + ) and evaluate 
(x + ,u + ). 


} 


Fig. 2 . Reduced basis algorithm for the AAO formulation with a multidisciplinary analysis step. 


Initialization: Choose an initial (T c ,mJ). 

Until convergence, do { 

0. Analysis: Solve h'px, u i, U 2 (x, u i)) = 0 for U 2 (x, u i). 

1. Test for convergence. 


7. Analysis: Solve h\(x + , uf, u^) = 0 for (x + ,uf ) and 
evaluate (x + , «+)• 


} 


Fig. 3 . Reduced basis algorithm for the AAO formulation with an analysis step that eliminates 112 • 


which is the merit function for the unconstrained MDF formulation. Thus we have a 
trust region algorithm for the MDF formulation. □ 

Thus, simply by the addition of an analysis step the reduced basis algorithm for 
the AAO formulation becomes a reduced basis algorithm for the MDF formulation, 
without any re-implementation of sensitivities or other procedures. In fact, this re- 
duced basis algorithm is an instance of a class of algorithms known as Generalized 
Reduced Gradient methods, which date back to Abadie and Carpentier in the early 
1960s. For further discussion and illustration of these methods applied to problems 
with state constraints, see [1, 2, 12, 13, 14], 

Now, what happens if I add a partial analysis step to the algorithm for the 
AAO formulation, an analysis that enforces feasibility with respect to the block of 
constraints / 12 , as in Figure 3? As it happens — as it was meant to happen — we obtain 
a reduced basis algorithm for the In-Between formulation. 

Theorem 5.2. The modified algorithm in Figure 3 is eguivalent to a reduced 
basis trust region algorithm for the In-Between formulation. 
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Proof. I will check the following: 

1. At every step, we are feasible with respect to the In-Between state constraint 
h 2 (x, u i, u 2 (x, «i)) = 0. This is true by design. 

2. The subproblem we solve in the context of the A AO formulation to improve 
model feasibility yields a similar step for the In-Between formulation. 

3. The subproblem we solve in the context of the A AO formulation to improve 
optimality subject to improved model feasibility does the same for the In- 
Between formulation. 

4. The merit function we use to evaluate the AAO formulation corresponds to 
the correct merit function for the In-Between formulation. 

We have 


BaAO^c 



dh 2 

du 2 


S -1 /n 


Bib hiB 


and hence 


h AAO 


0 


—aB 


W 2 s lf 


AAO 

h 

0 

dh 2 

du 2 


h 


-l 


dh 2 

dx 


IB ; 


where a is chosen so that 


-S 


_i dhi 
du 2 

-l 


dh 2 

du 2 


dh 2 

du 2 


dh ^ s -i 


du 


0 


In 


dh 2 

du 2 


-l 


\ 

dh 2 

dx ) 


h lf 

S IB 


= II W 2 s F b || = rb- 


lw 2 


du 2 


dh 2 
du 2 


0 

s LF 

s IB 


This settles Point 2. Again, note the non-standard scaling of the trust-region; this 
tracks the effect of the step on the implicit variable u 2 . 

Meanwhile, by Proposition 3.5, 


and 


Waao 

WaaoB aaoW aao 
w aao9ib 


W- 2 W IB , 
Wj B H IB W IB , 
WJbQib, 


WJaqHaaoSaao 


W f b W f H A aoW 2 s ff 
WJ b Bibs F b ■ 
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Thus we have the equivalence between the subproblem 

minimize \ o- T Wj AO H AAO W AAO o- + H aao sYao + 9 AA o a 

subject to || s AAO + W AA o<y || < 6 

for the AAO formulation and the subproblem 

minimize \(T T Wj B H IB W IB (T + H IB s AAO + gJ B a 
subject to || s f b + Wi B cr <6 

for the In-Between formulation, which establishes Point 3. 

Moreover, because /^(A, u\, U 2 (x, ui)) = 0 at each step, Proposition 3.3 tells us 
that the augmented Lagrangian merit function reduces to 

f A Ao(x, u(xj) + \ AAO h(x, u(xj) + | || h(x, u(xj) || 2 = 

fis(x, «i) + A J B h IB (x, Mi) + | || h IB (x, Mi) || 2 , 

which is the augmented Lagrangian for the In-Between formulation, confirming Point 
4. Thus we have a reduced basis trust region algorithm for the In-Between formulation. 
□ 

I have shown that the addition of an appropriate analysis step to the reduced 
basis algorithm in Figure 1 for the AAO formulation yields reduced basis algorithms 
for the MDF and In-Between formulations. Similar results hold for other algorithms 
one might consider, such as an SQP approach using a line-search. 

6. Conclusion. The results of §3 and §5 follow from structural features of opti- 
mization problems governed by state constraints. A number of other practical conse- 
quences of these structural features are discussed in [14]. The results presented here 
illustrate, I believe, some of the rich and useful structure of optimization problems 
with state constraints. 

I have shown that a properly structured and implemented reduced basis algorithm 
can be modified in a conceptually and practically simple manner to produce algorithms 
for alternative problem formulations, demonstrating a truly gratifying connection be- 
tween the formulation of a problem and algorithms for its solution. One can easily 
build a telescoping family of algorithms that correspond to different formulations of 
the problem, and have different computational behavior. 

The key to this approach is the availability of certain components inside the state 
constraints; in particular, at the very least the actions on vectors of the operators 

dhj dhj fdhiX -1 ( dhj \ ~ T 
dx ’ du ’ \ du ) ’ \du ) 

If simulation codes were written to make these components accessible to those of 
us doing optimization, then our task would be greatly facilitated, and alternative 
formulations and algorithms could be easily explored. 
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